Automatic Closed Caption Detection and Filtering in MPEG Videos for Video Structuring
نویسندگان
چکیده
Video structuring is the process of extracting temporal structural information of video sequences and is a crucial step in video content analysis especially for sports videos. It involves detecting temporal boundaries, identifying meaningful segments of a video and then building a compact representation of video content. Therefore, in this paper, we propose a novel mechanism to automatically parse sports videos in compressed domain and then to construct a concise table of video content employing the superimposed closed captions and the semantic classes of video shots. First of all, shot boundaries are efficiently examined using the approach of GOP-based video segmentation. Color-based shot identification is then exploited to automatically identify meaningful shots. The efficient approach of closed caption localization is proposed to first detect caption frames in meaningful shots. Then caption frames instead of every frame are selected as targets for detecting closed captions based on long-term consistency without size constraint. Besides, in order to support discriminate captions of interest automatically, a novel tool – font size detector is proposed to recognize the font size of closed captions using compressed data in MPEG videos. Experimental results show the effectiveness and the feasibility of the proposed mechanism.
منابع مشابه
Real Time Video Scene Detection and Classification
The VISION (Video Indexing for Searching Over Networks) digital video library system has been developed in our laboratory as a testbed for evaluating automatic and comprehensive mechanisms for video archive creation and content-based search, filtering and retrieval of video over local and wide area networks. In order to provide access to video footage within seconds of broadcast, we have develo...
متن کاملAutomatic Story Segmentation of Closed-Caption Text for Semantic Content Analysis of Broadcasted Sports Video
Sports videos can be characterized as a sequence of recurrent semantic story units. Storing sports videos in this story-unit-based form will lead to develop an intelligent content-based retrieval, browsing, and summarization system. The storage requires segmentation of videos and semantic understanding of each segment. Since transcribed broadcasted video speech, the closed-caption text, can be ...
متن کاملAutomatic Caption Localization in Compressed Video
ÐWe present a method to automatically localize captions in JPEG compressed images and the I-frames of MPEG compressed videos. Caption text regions are segmented from background images using their distinguishing texture characteristics. Unlike previously published methods which fully decompress the video sequence before extracting the text regions, this method locates candidate caption text regi...
متن کاملReal time video scene detection and classi®cation
The VISION (video indexing for searching over networks) digital video library system has been developed in our laboratory as a testbed for evaluating automatic and comprehensive mechanisms for video archive creation and content-based search, ®ltering and retrieval of video over local and wide area networks. In order to provide access to video footage within seconds of broadcast, we have develop...
متن کاملStory Segmentation of Broadcasted Sports Videos with Intermodal Collaboration
This paper investigates the problem of efficiently describing broadcasted sports videos for effective multimedia applications. Considering the sports videos as a sequence of recurrent semantic story units, we propose a method for segmenting the sports videos into the story units and attaching the closed-caption segments, which correspond to the story units, as the detailed descriptions. This pr...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- J. Inf. Sci. Eng.
دوره 22 شماره
صفحات -
تاریخ انتشار 2006